51 results found.
Language Type:
Multilingual
Languages:
Danish
Availability:
Freely Available
License:
<Not Specified>
Size:
5000 sentences Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>
Multimodal/Multimedia
Corpus,
Language Type:
Multilingual
Languages:
Danish
Availability:
From Owner
License:
Open Source
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
EnglishLanguage Type:
Multilingual
Languages:
Danish
Availability:
From Owner
License:
not decided
Size:
48000 Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
in progressLanguage Type:
Multilingual
Languages:
Danish Dutch Mandarin Chinese Standard Arabic
Availability:
Freely Available
License:
not yet known, presumably open source
Size:
400,000 multilingual named entities Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
RANLP'2011 paper
Written
Evaluation Data,
Language Type:
Multilingual
Languages:
Basque Bulgarian Danish Dutch English Estonian German Hungarian Irish Italian Portuguese Russian Serbian Slovenian Spanish
Availability:
Freely Available
License:
Size:
3 MByte Production Status:
Newly created-in progress
Use:
Lexicon Creation/Annotation
-
Paper title:A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Sina Ahmadi | Monolingual Word Sense Alignment | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
Danish
Availability:
Freely Available
License:
OpenSource
Size:
1.1M entries Production Status:
Newly created-finished
Use:
Summarisation
-
Paper title:DaNewsroom: A Large-scale Danish Summarisation Dataset
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Natalie Schluter | DaNewsroom | /N |
Documentation:
Yes, in English
Written
Treebank,
Language Type:
Monolingual
Languages:
Afrikaans Akkadian Amharic Ancient Greek Arabic Armenian Assyrian Bambara Basque Belarusian Bhojpuri Breton Bulgarian Buryat Cantonese Catalan Chinese Classical Chinese Coptic Croatian Czech Danish Dutch English Erzya Estonian Faroese Finnish French Galician German Gothic Greek Hebrew Hindi Hindi English Hungarian Indonesian Irish Italian Japanese Karelian Kazakh Komi Permyak Komi Zyrian Korean Kurmanji Latin Latvian Lithuanian Livvi Maltese Marathi Mbya Guarani Moksha Naija North Sami Norwegian Old Church Slavonic Old French Old Russian Persian Polish Portuguese Romanian Russian Sanskrit Scottish Gaelic Serbian Skolt Sami Slovak Slovenian Spanish Swedish Swedish Sign Language Swiss German Tagalog Tamil Telugu Thai Turkish Ukrainian Upper Sorbian Urdu Uyghur Vietnamese Warlpiri Welsh Wolof Yoruba
Availability:
Freely Available
License:
Various
Size:
25 million words Production Status:
Existing-updated
Use:
Parsing and Tagging
-
Paper title:Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Joakim Nivre | Universal Dependencies | /N |
Documentation:
https://universaldependencies.org
Speech/Written
List of resources,
Language Type:
Monolingual
Languages:
Danish
Availability:
License:
Size:
124 links to Danish language resources OtherProduction Status:
Existing-used
Use:
-
Paper title:World Class Language Technology - Developing a Language Technology Strategy for Danish
-
Paper track:Infrastructural Issues/Large Projects/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Sabine Kirchmeier | Overview of Language Resources for Danish | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
Arabic Bulgarian Catalan Croatian Czech Danish Dutch English Estonian Filipino Finnish French German Greek Hebrew Hindi Hungarian Indonesian Italian Japanese Korean Latvian Lithuanian Malay Norwegian Persian Polish Portuguese Romanian Russian Serbian Simplified Chinese Slovak Slovenian Spanish Swedish Thai Traditional Chinese Turkish Ukrainian Vietnamese
Availability:
Freely Available
License:
CC-BY-SA
Size:
60 GByte Production Status:
Newly created-in progress
Use:
Language Modelling
-
Paper title:Wiki-40B: Multilingual Language Model Dataset
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Rami Al-Rfou | Wiki40B-LM | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
Afrikaans Albanian Arabic Armenian Bangla Basque Bosnian Breton Bulgarian Catalan Croatian Czech Danish Dutch English Esperanto Estonian Filipino Finnish French Galician Georgian German Greek Hebrew Hindi Hungarian Icelandic Indonesian Italian Japanese Kazakh Korean Latvian Lithuanian Macedonian Malay Malayalam Norwegian Persian Polish Portuguese Romanian Russian Serbian Sinhala Slovak Slovenian Spanish Swedish Tamil Telugu Thai Turkish Ukrainian Urdu Vietnamese pt_br ze_en ze_zh zh_cn zh_tw
Availability:
Freely Available
License:
<Not Specified>
Size:
22.10G tokens Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:word2word: A Collection of Bilingual Lexicons for 3,564 Language Pairs
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yo Joong Choe | OpenSubtitles2018 | /N |
Documentation:
Yes, on the website.




